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ABSTRACT 

Measurements of clustering in large-scale imaging surveys that make use of photomet- 
ric redshifts depend on the uncertainties in the redshift determination. We have used 
light-cone simulations to show how the deprojection method successfully recovers the 
real space correlation function when applied to mock photometric redshift surveys. 
We study how the errors in the redshift determination affect the quality of the recov- 
ered two-point correlation function. Considering the expected errors associated to the 
planned photometric redshift surveys, we conclude that this method provides informa- 
tion on the clustering of matter useful for the estimation of cosmological parameters 
that depend on the large scale distribution of galaxies. 
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1 INTRODUCTION 

In recent years, photometric redshift surveys have been pro- 
posed as a way to extend large-scale structure studies to- 
wards higher redshifts than it is possible using spectroscopic 
surveys. These surveys observe a region of the sky through 
a number of filters, and use the photometry obtained to de- 
termine the redshifts, z, and spectral energy distributions 
(SED's) of galaxies. Using photometry instead of spectra al- 
lows them to get much deeper, but the uncertainty in the 
determination of redshifts is larger (Baum 1962; Koo 1986; 
Connolly et al. 1995; Blake & Bridle 2005; Fernandez-Soto 
et al. 2001). 

Two projects of this kind are COMBO- 17(Classifying 
Objects by Medium-Band Observations) and the ALHAM- 
BRA (Advanced Large, Homogeneous Area Medium Band 
Redshift Astronomical) survey. COMBO-17 (Wolf et al. 
2003) surveyed a total area of ~ 1 deg 2 using a combina- 
tion of 17 broad-band and medium-band filters. It provided 
photometric redshifts for ~ 25000 galaxies in 0.2 < z < 1.2, 
with a typical error of Az ~ 0.03. The ALHAMBRA sur- 
vey (Moles et al. 2006; Moles et al. 2008; Fernandez-Soto 
et al. 2008), currently ongoing, will observe a total area of 
~ 4 deg 2 , in 16 1° x 0?25 strips. It uses 20 medium-band, 
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equal- width filters covering the optical range, plus the stan- 
dard J, H, K s near-infrared filters. Moles et al. expect to 
obtain photometric redshifts for ~ 3 x 10 5 galaxies with 
Iab ^ 24.7 (60 per cent completeness level), z me d = 0.74, 
and Az ~ 0.015(1 + z). 

These surveys can therefore provide us with large and 
deep samples of galaxies. In order to study large-scale struc- 
ture using these samples, one has to deal with large red- 
shift errors. These redshift errors produce uncertainties in 
the determination of distances to the galaxies, and hence 
in their three-dimensional positions (Coe et al. 2006). This 
uncertainty has to be added to the one produced by pecu- 
liar motions of galaxies. The latter is important for spectro- 
scopic surveys, while the former dominates the uncertainties 
in photometric surveys. 

In the present work, we focus on the two-point correla- 
tion function, £(r). We study how it is affected by redshift 
errors, and describe a method to recover its real-space value 
from photometric redshift survey data. The method we use 
is based in measuring the two-dimensional correlation func- 
tion, £(<r, 7r) (where 7r is the line-of-sight separation, and a is 
the transverse separation), obtaining from it the projected 
correlation function, 2(<r), and deprojecting it. This method 
(outlined in Section 3) was first proposed by Davis & Pee- 
bles (1983) as a way to avoid the uncertainties due to pe- 
culiar velocities in spectroscopic surveys, and has been used 
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successfully in subsequent analyses (e.g. Saunders, Rowan- 
Robinson & Lawrence 1992; Hawkins et al., 2003; Madgwick 
et al., 2003; Zehavi et al., 2004, 2005a). 

Phleps et al. (2006) studied the correlation function of 
galaxies in COMBO-17. They used H(a) as a measure of 
real-space clustering, and compared it to the predictions of 
halo occupation models. However, they did not attempt to 
recover £ (r) from their data. 

We tested this method using data from the light-cone 
simulation of Heinamaki et al. (2005). From the simulation, 
we produced three mock photometric redshift catalogues, 
corresponding to different accuracies in the determination 
of redshifts. We then compared the correlation function £(r) 
obtained by our method in each case to the real-space one, 
computed from the original catalogue. 

We describe the simulation and the way in which we cre- 
ated the mock catalogues in Section 2. Section 3 describes 
our method to calculate £(r) from the simulated data. In 
Section 4 we present our results, and we summarize our con- 
clusions in Section 5. 



2 DATA USED 

The catalogues used in this paper come from the light- 
cone simulation of Heinamaki et al. (2005). They simu- 
lated the distribution of dark matter haloes in a light-cone 
covering 2° x 0?5 in the sky for a standard ACDM cos- 
mology (VL DM = 0.226, VL B = 0.044, fi A = 0.73, h = 
ffo/lOOkms^Mpc- 1 = 0.71, a 8 = 0.84). We used the full 
output catalogue of the simulation, which contains haloes 
with M > 7.14- 10 10 h' 1 Mq. We calculate the halo correla- 
tion function in our analysis. Its behaviour should be similar 
enough to the galaxy correlation function as to correctly as- 
sess the validity of our method. 

As in this work we are not interested in the evolution 
of the correlation function with redshift, we restricted our 
study to the redshift bin z £ [2,3]. The volume considered, 
in co- moving coordinates, is 864 /i -1 Mpc long in the line-of- 
sight direction, while its transverse section varies between 
130 x 32 /i -1 Mpc in its close end to 160 x 40ft~ 1 Mpc in 
its far end. The total volume is 4.56 x 10 6 h~ 3 Mpc 3 and it 
contains ~ 180000 haloes. We chose that redshift interval for 
two reasons. At lower redshifts, the light-cone is too narrow, 
while at higher redshifts it contains too few haloes. 

We generated three mock 'photometric redshift cata- 
logues', corresponding to surveys with redshift uncertainties 
Az/(l + z) = 0.05,0.015,0.005. The first case (Az/{l + z) = 
0.05) corresponds typically to a survey using ~ 5 broad-band 
filters (see e.g. Fernandez-Soto et al. 2001). Az/(1 + z) = 
0.015 corresponds to the value expected from the ALHAM- 
BRA survey (Moles et al. 2008). The last case, Az/(l + z) = 
0.005, would correspond to a future survey using even more 
filters. As an example, the PAU (Physics of the Accelerat- 
ing Universe) survey project (Bemtez et al. 2008) aims at 
obtaining photometric redshifts for Luminous Red Galaxies 
(LRG's) with uncertainties Az/(l + z) ~ 0.0035 for z < 0.9. 
As the uncertainty in photometric redshifts decreases for 
high redshift galaxies (z > 2.5), when the Lyman-a wave- 
length enters into the visible domain, it should also be pos- 
sible, in principle, to get such a small Az in this case. 

In creating our mock catalogues, we assumed Gaussian 



errors for the photometric redshifts. This is not generally 
the case for real surveys, due to the existence of a fraction 
of 'catastrophic' redshift determinations, and to the mix of 
different classes of objects with a variety of photometric 
redshift errors. However, our assumption of single-peaked 
Gaussian-distributed errors would be valid for a catalogue 
selected to contain only "good" redshifts. This catalogue 
could be built combining the selection of a given class of ob- 
jects (e.g. LRG's), with the use of some estimate of the red- 
shift determination quality. The latter could be the knowl- 
edge of the full redshift probability function (Fernandez- 
Soto et al. 2002), or the 'odds' parameter in the case of 
Bayesian methods (Bemtez 2000). Existing experience in- 
dicates that, depending on the survey design, it is possible 
to obtain "good" redshifts for objects down to magnitudes 
rreiim — 1 or TOn m — 2, where mn m is the limit magnitude of 
the survey. 

At the end of Section 4.2, we assess the robustness of 
our results to the presence of 'catastrophic' redshifts. We 
consider catalogues with a fraction of such outliers of 5 per 
cent. This is a conservative value, typical of broad-band, 
non-optimized photometric redshift surveys, and it should 
be significatively smaller in the case of "good" redshifts. 
As an example, Ilbert et al. (2008) compiled a photometric 
redshift catalogue for the COSMOS field, using 30 bands 
ranging from the ultraviolet to the mid-infrared. They ob- 
tained just 0.7 per cent of outliers when comparing their 
bright sample (i\ B < 22.5) to spectroscopic redshifts. 

To generate each mock catalogue, we modified the po- 
sition of each point in the simulation following these steps: 

(i) We calculated the 'cosmological redshift' of the object 
from its real-space position. 

(ii) We added to this 'cosmological redshift' the redshift 
due to the peculiar velocity of the object. These peculiar 
velocities of the haloes are provided by the simulation. 

(iii) To simulate the expected redshift errors, we added a 
random shift to the resulting redshift, following a Gaussian 
distribution with variance equal to Az in each case. The 
redshift obtained is the 'observed redshift' of the object. 

(iv) We finally obtained the three-dimensional position 
of the object corresponding to this 'observed redshift' and 
included it in the mock catalogue. 

This distortion process was carried out for all the points 
in the whole cone of the simulation. The selection of the 
points in the redshift bin z 6 [2, 3] was performed using 
the new 'observed redshifts', thus simulating the selection 
process in a real survey. 

Fig. 1 shows the distribution of haloes in the original 
catalogue and in the mock photometric catalogues. The Fig- 
ure shows the real space positions of haloes, not affected by 
peculiar velocities, and thus does not show the finger-of-God 
or coherent infall effects observed in spectroscopic surveys. 
Due to redshift errors, structures which are clearly seen in 
real space are smoothed and hardly recognizable in photo- 
metric redshift data. 



3 METHOD 

The most widely used method to measure the correlation 
function consists in comparing the distribution of points in 
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Figure 1. The distribution of haloes in the four catalogues used: the original real-space catalogue, and the three mock photometric 
catalogues. The distribution is shown projected on a longitudinal plane, and only 20 per cent of the points are shown, for clarity. 



the data catalogue with a random distribution of points gen- 
erated in the same volume. To make the comparison, one 
calculates the number of pairs with separation in the range 
[r, r + dr] between points in the data catalogue (DD(r)), be- 
tween points in the random catalogue (RR(r)), and between 
a point in the real catalogue and a point in the random cata- 
logue (DR(r)). The estimator used in this work to compute 
£(r) is (Landy & Szalay 1993): 



C(r) = l + 



(N R y DD(r) N R DR(r) 
\N D J RR(r) N D RR(r) 



(1) 



where Nd is the number of points in the data catalogue, and 
Nr is the number of points in the random catalogue. 

However, this method can not be used when the studied 
catalogue comes from a photometric survey. The large errors 
in redshift and hence in the line-of-sight positions produce 
two effects that have to be considered. On one side, these 
random shifts in position erase correlations between points, 
and hence £(r) measured according to (1) would be much 
lower than the real £(r). On the other side, as the shifts are 
only in the line-of-sight direction, isotropy of the distribu- 
tion is lost. Correlation is only lost along the longitudinal 
direction, but it is conserved in the transverse plane. 

The method we used to recover real-space £(r) from the 
mock photometric redshift catalogues is the same described 
in Davis & Peebles (1983) and Saunders et al. (1992) for 



spectroscopic surveys. It is based in the decomposition of 
pair separations in parallel and perpendicular distances (n 
and a, respectively). 

Let si and S2 be the measured positions (in 'observed 
redshift space') of two points in the catalogue. We then de- 
fine the separation vector, s = — si, and the line-of-sight 
vector, i = si + S2, of the pair. From these, we now define 
the parallel and perpendicular distances of the pair as 
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(2) 



Once we have defined ir and a for each pair of points, 
we can calculate the two-dimensional correlation function, 
f(cr, 7r) in an analogous way to equation (1), substituting 
the (r) dependence by (a, it). From £(cr, n), we define the 
projected correlation function as 



(a) = 2 f(a,7r)d7r. 



(3) 



As S depends only on a, and the angle between any pair of 
points is small, it will not be affected significantly by redshift 
errors, as these will mainly produce shifts in n. 

Assuming that the real-space distribution is isotropic, 
we can relate S to the real-space correlation function, £ r , as 



tr(r) 



rdr 



(r 2 — a 2 ) 



1/2 



(4) 
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This relation can be inverted, obtaining £ r in terms of H as 
the Abel integral: 



dH(a) 



do- 



do- ( cr 2_ r 2)l/2 



(5) 



Therefore, the method proposed to compute £(r) from 
photometric survey data consists of the following steps. We 
first obtain £(o, 7r) from counting pairs of points in the data 
and in the random catalogues. The projected correlation 
function, S(o), is then obtained by integration of equa- 
tion (3). Finally, the real-space correlation function, £(r), is 
calculated from equation (5). Some problems arise in the nu- 
merical integration of equations (3) and (5) . Both integrals 
extend formally to +oo. However, when computing them nu- 
merically, we have to set finite upper limits, 7r max and o max . 

In the first case, the value of 7r max should be large 
enough to include almost all the correlated pairs. However, 
if it is too large, this would introduce extra noise in the 
calculation. 

When integrating equation (5), the upper limit o max is 
fixed, for pencil-beam surveys, by the maximum transverse 
separation allowed by the geometry. The way we used to 
evaluate (5) was that of Saunders et al. (1992). We interpo- 
lated linearly E between its values in each a bin, and then 
integrated (5) analytically. Taking Ej as the value of S for 
the bin centred at o; , we have 



In 



Redshift errors will influence the result in two ways. 
First, these errors change the apparent line-of-sight direction 
l/\l\ (see equation (2)), and through that, the apparent line- 
of-sight distance n, and, most important, the perpendicular 
distance o. These errors grow with the redshift error and 
with the width of the galaxy pair. 

Another, and much stronger source of errors is the as- 
sumption that the apparent distance in redshift space is the 
real distance between two galaxies - this assumption is nec- 
essary to obtain our basic integral relation (4). In case of 
photometric errors, this assumption is hardly justified, but 
we will see that the inverted correlation functions are close 
to the real one, anyway. The errors caused by this assump- 
tion grow with the redshift errors. 



4 RESULTS 

We tested the deprojection method to measure £(r) using 
the mock photometric catalogues described in Section 2. We 
applied the method described above to data from the three 
mock catalogues, and obtained £dep(r) in each case. These 
were then compared to the real-space £ r (r) calculated from 
the undistorted simulation catalogue using equation (1) di- 
rectly. 

The value used for the integration limit in equation (5) 
was o max = 130 ft -1 Mpc. This is about 80 per cent of the 
maximum transverse separation allowed by the geometry of 
the light-cone. We used 32 bins in o, logarithmically spaced 
between 0.1 Mpc and o" max . 

We performed several tests to choose the appropriate 



value for 7r max , scaling it relative to the Az of the cata- 
logue considered. The £(r) recovered increased with ir max 
for values 7r max < 3Az, and converged to fixed values for 
larger 7r max . However, the noise also increased with 7r max . 
We adopted a conservative value of 7r max ~ 4A« for our cal- 
culations, in order to be sure to include all correlated pairs, 
and not introducing too much extra noise. 

To estimate the correlation function error and covari- 
ance between bins in r, we used the jackknife method (see 
e.g. Zehavi et al. 2005b). We divided our volume in 12 equal 
sub-volumes, and constructed our jackknife samples omit- 
ting one sub-volume at a time. We repeated the full cal- 
culation of £(r) for each of these samples. Denoting by $ 
the value of the correlation function obtained for bin i in 
jackknife sample k, the covariance matrix is then 



where is the average of the values obtained for bin i, and 
N = 12. 



4.1 Effect of redshift errors on £(<x, 77 ) 

As a first step in the calculation of £(r), we calculated £(ct, it) 
for each mock catalogue. The results are shown in Fig. 2. We 
also plot the £(o, n) obtained in the real-space catalogue, 
for comparison. Two effects of the redshift errors can be ob- 
served. First, correlation decreases with the value of Az for 
each catalogue. Also, there is a loss of symmetry of £(o, 7r) 
in these plots. In real space, due to the isotropy of the dis- 
tribution, £(cr, 7r) has circular symmetry (seen as a 'boxy' 
symmetry in the logarithmic scale used). However, we can 
see that when we calculate it for the mock photometric cat- 
alogues, the distribution gets stretched along the 7r axis. 



4.2 Tests of the deprojection method 

When comparing our results for £dep(r) with the real- 
space £ r (r), we restricted the analysis to the range r £ 
[0.5, 30] h,- 1 Mpc. The lower limit is given by the way haloes 
were selected in the simulation. They were selected using a 
friends-of- friends algorithm, therefore if we had two haloes 
at a too small separation, they would merge into a single 
halo (Heinamaki et al. 2005). This prevents us from mea- 
suring £(r) at such small distances. The upper limit was 
fixed because of the geometry of the light-cone. As the max- 
imum separation along the short transverse axis is between 
30- 40 h,- 1 Mpc, we can not trust our method beyond these 
scales. 

We measured £ r (r) directly from the real-space cata- 
logue. In order to compare the real-space result to the one 
obtained using our method in the mock photometric cata- 
logues, we fitted £ r (r) by a third-order spline. 

From £(o, n), we obtained the projected correlation 
function E(cr) for each of the mock photometric catalogues. 
In Fig. 3 we compare the function E(o) calculated for the 
mock catalogues to the function obtained from the spline fit 
to the real-space £r(r), according to equation (4). The results 
obtained for the Az/(l + z) = 0.005 and Az/(l + z) = 0.015 
catalogues follow closely the real-space result. In the case 
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Figure 2. The two-dimensional correlation function £(cr, n) obtained for the real space catalogue (top left), and for the mock photometric 
catalogues with Az = 0.005(1 + 2) (top right), Az = 0.015(1 + z) (bottom left) and Az = 0.05(1 + z) (bottom right). Contours are drawn 
at £ = 10, 3, 1, 0.3, 0.1, 0.03, 0.01, 0, with decreasing thickness. Contours at 0.1 and are dashed. 



of the Az/(1 + z) = 0.05 catalogue, however, H(cr) falls be- 
low the real-space result for r > 10 h~ x Mpc. This feature is 
due to the fact that the value of 7r max = 600 Mpc used 
for that catalogue gets close to the line-of-sight length of the 
simulation box used. As 7r max scales with Az, this issue does 
not affect the other catalogues. 

Our final result for the deprojected correlation function, 
£dep(r) obtained from the mock photometric catalogues is 
shown in Fig. 4, where we compare it to the real-space cor- 
relation function, (, r (r). 

To quantify the quality of the recovery, we used an 'av- 



erage normalized residual', A£, as figure of merit, defined 



?de P (r;) - £ r (ri) 



where n are the values of the bins in r where we measure £, 
and N is the number of such bins considered. 

Without prior knowledge of £ r (r) we could anyhow es- 
timate the quality of the recovery calculating the quantity: 
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Figure 4. Comparison between the deprojected correlation function, £dcp( r ) (open circles), and the real-space correlation function, £rM 
(solid circles), for each mock photometric catalogue. The error bars plotted correspond to the diagonal terms in the covariance matrix, 
C 1/2 . 



We show the values of A£ and A£ obtained for the dif- 
ferent mock catalogues in Table 1. We computed them for 
different ranges in r, in order to assess the validity of the 
method at different scales. From the values of A£ we see that 
we recover £(r) within a 5 per cent in the average for scales 
r < lOftT 1 Mpc for mock catalogues with Az < 0.015(1 + 2:). 
At larger scales, the deviations from £ r are larger (12 — 20 per 
cent). In the case with the largest redshift errors, our method 
is only valid for very small scales, r < 2 h~ x Mpc, where the 



deviations are of a 7 per cent. We note that, in all cases 
where the method is valid, A£ > A£. Hence, the jackknifc 
method allows us to estimate the errors to an acceptable 
precision. We remark, however, that for large values of Az 
the jackknife error underestimates the real one as measured 
from the residuals or compared to other Az values, specially 
over medium scales (2 — 20 h~ x Mpc). 

Fig. 5 shows the covariance matrix for the real-space 
calculation of (, r (r), and Fig. 6 shows it for the calculation of 
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Table X. Values of A£ and A£ obtained for the three mock photometric catalogues and for different scale ranges. 
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Figure 3. The projected correlation function obtained from the 
mock photometric catalogues, compared to the real-space result. 
The solid line corresponds to a spline fit to 5r(f), as explained 
in the text. Small shifts have been applied along the a axis, for 
clarity. The feature observed at large scales for the Az = 0.05(1 + 
z) catalogue is due to the large 7r ma x value used in that case. 



£de P (y) in each case. As the absolute values of the covariance 
drop rapidly with distance, we show here the normalized 
covariances 



Ci 



While the absolute values of the covariances grow with 
the redshift error, and are always larger than the covariances 
for the real-space correlation function, the structure of the 
covariance matrix is different. While the real-space covari- 
ances are large for almost all bin pairs, the covariance ma- 
trices for photometric correlation functions are much closer 
to diagonal. This is similar to the fact that the best esti- 
mates of the correlation function are obtained integrating 
over line-of-sight distances, even from spectroscopic redshift 
catalogues (see, e.g., Zehavi et al. (2005a)). 

In order to assess the robustness of the method to the 
presence of 'catastrophic' redshift determinations, we re- 
peated the calculation in catalogues containing 5 per cent of 
such outliers. Outliers were created selecting points at ran- 
dom in the original catalogue, and assigning them a random 
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Figure 5. The normalized covariance matrix (Cjw) of the corre- 
lation function measured directly from the real-space catalogue. 



distance within the range considered. Even with the conser- 
vative assumption of a large fraction of outliers, our method 
recovers £(»"), although the values of A£ are slightly larger in 
this case, ranging from 5 to 13 per cent for r < 10ft _1 Mpc 
and Az sC 0.015(l + z). The result obtained for the catalogue 
with Az = 0.015(1 + z), and containing 5 per cent of out- 
liers is shown in Fig. 7. Similar results were obtained when 
a fraction of the outliers were taken from a Poisson sample 
within our volume. This case would reproduce the effect of 
stellar contamination in the catalogue. 

We performed an additional test of the deprojection 
method using a realization of a segment Cox process, for 
which an analytical expression of the correlation function is 
known (Martinez et al. 1998). The results of those tests are 
further explained in Appendix A. 



5 CONCLUSIONS 

We have shown the reliability of recovering the real-space 
two-point correlation function from photometric redshift 
surveys. We have used light-cone simulations to produce 
mock catalogues that have been distorted by randomizing 
along the line of sight the object positions following Gaus- 
sian distributions with different variances similar to the as- 
sociated nominal errors of the photometric redshift surveys 
Az/(l + z). 
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Figure 6. The normalized covariance matrices (Cjw) of the deprojected correlation function measured directly from the mock photometric 
catalogues with Az = 0.005(1 + z) (top left), Az = 0.015(1 + z) (top right) and Az = 0.05(1 + z) (bottom). 



The method used to recover the real-space correlation 
function consists in obtaining the projected correlation func- 
tion by integrating the two-dimensional correlation function 
along the line of sight. The projected correlation function is 
then deprojected assuming that redshift errors do not affect 
transverse distances. 

The deprojection method applied on the distorted mock 
surveys provides quite satisfactory results for recovering the 
real-space correlation function. We have quantified the qual- 
ity of the recovering process as a function of the errors in 
the photometric redshifts. Our method was able to recover 
the real-space correlation function within a 5 per cent for 
r < 10 h" 1 Mpc from photometric catalogues with Az ^ 
0.015(1 + z). For larger redshift errors, the method is only 
valid (within a 7 per cent) for smaller scales, r < 2 Mpc. 
Hence, our method allows the extraction of useful informa- 
tion on the clustering of galaxies through the correlation 
function. That information can be used for the estimation 



of cosmological parameters based on data from photometric 
redshift surveys. 

We discuss now possible alternatives to the method ex- 
posed here. A variation of the method would be to use a 
smaller value of 7r max in the integration of equation (3), 
and multiply the result by a constant correction factor. This 
would reduce the extra noise introduced by the integration 
along a large range in the it direction. We found that, for 
7r max ~ Az, a correction factor of ~ 2 works well in our sim- 
ulation, generally reducing the error. However, the optimal 
value is slightly different for each mock photometric cata- 
logue. The main problem for the use of this method would 
be the accurate determination of the correction factor in 
each case, as any deviation from the optimal value would 
introduce a bias in the result. The Gaussian approximation 
used here is probably not so close to reality as to infer that 
constant from our simulations. 

Another possible alternative to the method described in 
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Figure 7. The deprojected correlation function obtained for the 
Az = 0.015(1 + z) catalogue with and without outliers. The con- 
tinuous line is the real-space correlation function. 



this work could make use of the new estimator u> presented in 
Padmanabhan, White & Eisenstein (2007). They proposed 
lo as an alternative to the projected correlation function to 
use with spectroscopic survey data. In principle, it can also 
be applied to photometric redshift survey data, in a way 
similar to the one presented here, but we did not investigate 
further this possibility. 
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APPENDIX A: TEST OF THE METHOD 
USING A SEGMENT COX PROCESS 

In order to test our method against an analytical prediction 
for £(r), we used a distribution of points given by a segment 
Cox process. This process is produced in the following way 
(Martinez & Saar 2002): segments of a given length, I, are 
randomly scattered within a volume. Then, points are ran- 
domly distributed along these segments. The lenght density 
of the system of segments is Ly — X 3 l, where A s is the mean 
number of segments per unit volume. The density of the 
point process is then 

A = XiLv = A; \ s l , 

where A; is the mean number of points per unit length of 
the segments. The correlation function of the point process 
equals the correlation function of the system of segments 
(Stoyan, Kendall & Mecke 1995), which is given by 
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£cox(r) = 
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2irrlL v ' 
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r < / 
r > I 



(Al) 



We simulated a segment Cox process in the same vol- 
ume considered in the rest of this work. The parameters 
we used were I = 50 hT l Mpc, A s = 2 ■ 1CT 4 h 3 Mpc^ 3 and 
A; = 4 ft Mpc -1 , which result in L v = 0.01 ft 2 Mpc -2 and 
A = 0.04 ft 3 Mpc -3 . These parameters were chosen to ap- 
proximately match the density of points and the behaviour 
of £(r) in the haloes simulation. We considered the cata- 
logue obtained directly from the segment Cox process as the 
'real-space' catalogue. We created three mock 'photometric 
redshift catalogues' following the same procedure and using 
the same values for Az as described in Section 2 . 

We calculated directly the correlation function for the 
real-space catalogue according to equation (1). For the three 
mock 'photometric catalogues', we used the method de- 
scribed in Section 3 to obtain the deprojected correlation 
function. The estimation of errors was performed using the 
same jackknife method as described above for the haloes 
simulation case. We checked that the errors obtained were 
comparable to the variance of the results from several real- 
izations of the Cox process. The comparison of our results 
to the analytical prediction (Al) is shown in Fig. Al. 

We quantify the quality of the recovery in the same way 
as we did for the haloes simulation, using the quantities A£ 
and A£. In this case, we define A£ as the relative devia- 
tion of £de P (f) from the analytical prediction £cox(r) (equa- 
tion (Al)). The values obtained are shown in Table Al. 

We recover the real-space correlation function within 
a 10 per cent for the Az/(1 + z) = 0.005 catalogue. In this 
case, however, our method starts to fail at r ~ 3 — 4 ft -1 Mpc 
for the Az/(l + z) = 0.015 catalogue (this is seen as a larger 
value of A£ for this range, and as an increasing trend in 
Fig. Al). When applying the method to the Az/(1 + z) = 
0.05 catalogue, £de P (r) is consistently higher than £cox(r). 
Although this bias could be an artefact of this particular 
point process, it also means that the deprojection method 
described in this work can not be fully trusted when it is 
applied to catalogues with large redshift errors. 

This paper has been typeset from a TjrjX/ F/TfrjX file prepared 
by the author. 
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Figure Al. The correlation function measured in the real-space and the three mock 'photometric' catalogues obtained from a segment 
Cox process (open circles), compared to the analytical prediction, equation (Al), for this process (solid line). 



Table Al. Values of A£ and A£ obtained for the three mock photometric catalogues obtained from a Cox process, and for different 
scale ranges. 



Range (h^ 1 Mpc) 


Az 
(1+*) 

A? 


= 0.005 
A£ 


Az 
(1+*) 

A£ 


= 0.015 

a! 


Az 
(1+*) 

A? 


= 0.05 
A£ 


0.5 < r < 10 


0.08 


0.16 


0.18 


0.23 


0.35 


0.40 


0.5 < r < 2 


0.06 


0.08 


0.07 


0.12 


0.23 


0.21 


2 < r < 10 


0.10 


0.22 


0.27 


0.33 


0.46 


0.57 



